Supporting information S1 Text for Modeling genetic interactions using generalized linear models
نویسندگان
چکیده
A generalized linear model describes the distribution of random variable conditioned on one or more other random variables. The generalized linear model consists of two components: a distribution f(y) in the exponential family and a link function g that maps the linear predictor to the mean parameter of the distribution. Let yi for individual i ∈ {1...n} be the random variable that we want to model conditioned on a set of covariates xij for j ∈ {1...m}. Let X denote the matrix of covariates, and xi the vector of covariates corresponding to individual i. Vectors are denoted by bold face. The expected value of the outcome μi = E[Yi] for individual i is described as a function of the covariates g(μi) = xiβ = ηi, where we introduce ηi to simplify notation later. Because the distribution belongs to the exponential family, the likelihood of the model can be written on the following form
منابع مشابه
Supporting information Text S1 for Unifying Viral Genetics and Human Transportation Data to Predict the Global Transmission Dynamics of Human Influenza H3N2 Contents
1 Materials and Methods 1 1.1 Sequence and location data . . . . . . . . . . . . . . . . . . . . . . . . . . . 1 1.2 Incorporating uncertainty in air community assignment . . . . . . . . . . . . 3 1.3 Bayesian statistical analysis of sequence and trait evolution . . . . . . . . . . 4 1.3.1 GLM diffusion implementation and predictor support . . . . . . . . . 5 1.3.2 GLM diffusion predictors . . ...
متن کاملDust source mapping using satellite imagery and machine learning models
Predicting dust sources area and determining the affecting factors is necessary in order to prioritize management and practice deal with desertification due to wind erosion in arid areas. Therefore, this study aimed to evaluate the application of three machine learning models (including generalized linear model, artificial neural network, random forest) to predict the vulnerability of dust cent...
متن کاملUsing multivariate generalized linear latent variable models to measure the difference in event count for stranded marine animals
BACKGROUND AND OBJECTIVES: The classification of marine animals as protected species makes data and information on them to be very important. Therefore, this led to the need to retrieve and understand the data on the event counts for stranded marine animals based on location emergence, number of individuals, behavior, and threats to their presence. Whales are g...
متن کاملThe BUGS Book: A Practical Introduction to Bayesian Analysis
This long-awaited text by the developers of BUGS, the most widely used software for Bayesian data analysis, provides a thorough description of BUGS and how to use it for Bayesian modeling. The first four chapters provide a introduction to Bayesian inference, the BUGS language, and the ideas behind Markov chain Monte Carlo (MCMC) methods. After this introduction, prior distributions are discusse...
متن کاملQSAR models to predict physico-chemical Properties of some barbiturate derivatives using molecular descriptors and genetic algorithm- multiple linear regressions
In this study the relationship between choosing appropriate descriptors by genetic algorithm to the Polarizability (POL), Molar Refractivity (MR) and Octanol/water Partition Coefficient (LogP) of barbiturates is studied. The chemical structures of the molecules were optimized using ab initio 6-31G basis set method and Polak-Ribiere algorithm with conjugated gradient within HyperChem 8.0 environ...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2016